#free token slots14/11/2025
TiDAR: NVIDIA's Hybrid Diffusion-Autoregressive Design That Multiplies LLM Throughput
NVIDIA's TiDAR combines one-step diffusion drafting with autoregressive verification in a single forward pass to exploit free GPU token slots and multiply tokens-per-forward by up to about 6x while preserving benchmark quality.